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Nonuniform Oversampled Filter Banks for Audio Signal 

Processing 

Background of the Invention 

5 This application claims priority under application number 60/243,81 1 filed 

on October 27, 2000. 

This invention relates to filters and, more particularly, to filter banks for 
audio applications. 

In many areas of audio processing, it is desirable to analyze an audio signal 

10 in approximately the same time-frequency form as the human ear (i.e., with 
bandwidths on the order of one Bark) and with a time resolution that is 
commensurate with the bandwidth of the filter. In addition, it is desirable to 
process the signals in each of the bands and then reconstruct them in a manner 
such that when the bands are unmodified, the filter bank has a nearly perfect 

15 reconstruction characteristic. Because the signals might be modified, and different 
bands might be routed to different devices, not only must the filters provide 
approximately exact reconstruction, they must also prevent aliasing due to the 
unequal processing, or modification, of adjacent frequency bands. Hence, an 
oversampled filter bank is required where aliasing introduced due to unequal 

20 processing of bands is below the level of human hearing. 

One application for this kind of filter bank is found for the problem of 
separating parts of an individual audio signal into its directs and indirect parts for 
the purpose of rerouting, in real time, the direct and indirect signals to drivers that 
reproduce them appropriately. In such an application, a filter bank that 

25 approximates the critical bandwidths is essential to identifying the part of a signal 
with direct cues, and the ability of the reconstruction filter bank to prevent 
substantial aliasing when adjacent bands are added incoherently (as opposed to 
coherently) is also an absolute requirement. Hence the need for an oversampled 
critical band filter bank. In applications that require nonuniform division of signal 

30 spectrum, iterative cascaded of uniform filter banks are often used. Iterated filter 
banks, however, impose considerable structure on the equivalent filters, which 
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results in inferior time-frequency localization compared to direct designs. A study 
of critically sampled nonuniform filter banks has been reported by J. Princen in 
"The Design of Nonuniform Filter Banks," IEEE Transactions on Signal 
Processing, Vol. 43, No. 11, pp. 2550-2560, Nov. 1995. Nonuniform filter banks 
5 studied by Princen are obtained by joining pseudo QMF filter bank sections that 
are nearly perfect reconstruction filter banks based on cosine modulation and the 
principle of adjacent channel aliasing cancellation. R. Bernardini et al published 
"Arbitrary Tilings of the Time-Frequency Plane using Local Bases," IEEE 
Transactions on Signal Processing, Vol. 47, No. 8, pp. 2293-2304, August 1999, 

10 wherein they describe a cosine-modulation-based structure that allows for time- 
adaptive nonuniform tiling of the time-frequency plane. Despite their many fine 
features that are relevant to coding purposes, however, these approaches do not 
have good aliasing attenuation properties in each of the subbands independently. 
This makes them unsuitable for tasks where processing effects need to be 

15 contained within the bands directly affected. Perfect, or nearly perfect, 

reconstruction properties of these filter banks in the presence of upsampling are 
also not clear. The pseudo QMF bank, for instance, loses its aliasing cancellation 
property if the subband components are not critically downsampled. 

Oversampled uniform filter banks based on cosine modulation were studied 

20 by Bolceskei et al, and reported in "Oversampled Cosine Modulated Filter Banks 
with Perfect Reconstruction, IEEE Transaction on Circuits and Systems II, Vol. 45, 
No. 8, pp. 1057-1071, August 1998, but the cosine modulation places stringent 
aliasing attenuation requirements. 

25 Summary of the invention 

An advance in the art is attained with non-uniform filter banks created by 
joining sections of oversampled uniform filter banks that are based on complex 
exponential modulation (as opposed to cosine modulation). Each filter bank 
handles a given, non-overlapping frequency band, and the filters of different banks 
30 have different bandwidths. The frequency bands of the different filter banks cover 



2 



Cvetkovic 2000-0239 



the frequency of interest with gaps in the neighborhoods of the filter band edges. 
A set of transition filters fills those gaps. 

Brief Description of the Drawing 

5 FIG. 1 shows an analysis filter bank stage, followed by a signal processing 

stage and a synthesis filter bank stage; 

FIG. 2 shows the FIG. 1 arrangement, modified with a subsampling element 
interposed between the output of each analysis filter bank and the signal 
processing stage, and an upsampling element interposed between the signal 
10 processing stage and each synthesis filter bank; 

FIG. 3 shows the frequency response of the filter arrangement disclosed 
herein; and 

FIG. 4 presents a block diagram of the filter arrangement disclosed herein. 

15 Detailed Description 

In accordance with the principles disclosed herein, nonuniform, 
oversampled filter banks are obtained by joining section of different uniform filter 
banks with the aid of transition filters. The uniform filter banks that are used are 
nearly perfect-reconstruction, oversampled, modulated filter banks. 
20 FIG. 1 shows an analysis filter bank 100, followed by a processing stage 

200, and a synthesis filter bank 300. Analysis filter bank 100 is shown with N filter 
elements (of which elements 11, 12, 13 are explicitly shown), where each is a 
modulated version of a given window function, v[n]. That is, the time response of 
the / th filter in the bank is: 

— ( -n 

25 h,[n] = v[n]e N i = l,2,...,N. (1) 

j—{i-\)n 

The term e N 1 creates filters that are offset in frequency from one another, 
through a complex modulation of the fundamental filter function v[n] , and some 
artisans call these filters "modulates" of the filter v[n] . 

It can be shown that, when there is no subsampling in the channels of such 
30 a filter bank (and, indeed, analysis filter 100 shows no subsampling), a necessary 
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and sufficient condition for perfect and numerically stable reconstruction of any 
signal, x, from its subband components^, where 



(2) 



is given by 



JV-l 

0<A<£ 

/=0 



f In . 
j{o-—i)^ 



V{e N ) 



<B <cc, coe(-7t,7r). 



(3) 



In accord with the principles disclosed herein, however, a subclass of 
windows is considered that satisfy the equation (3) condition with the caveat that 
A=B=N; which reduces equation (3) to: 



(=0 



V(e 



J ( CO 1) 

V N J 



= N for all 



(4) 



io The power complementarity condition of equation (4) permits good control over the 
effects of processing in the subband domain, and also assures stability. 

When the analysis filters satisfy this condition, the norm of the input signal 
is related to the norm of the corresponding subband components^ as follows: 



(5) 



15 It is noted that the considered signals belong to the space of square summable 



sequences, so the norm of jc corresponds to ||x|| = 



( 



vl/2 



If some processing modifies the subband components y t to y i + e l (that is, 
the input signals to filter bank 300 in FIG. 1 are the signals ( y. +e t ), the total 
distortion in the signal x that is synthesized from the modified subband 
20 components is bounded by 
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II II 2 ^ 1 VII II 2 
II *n xr^n 'H 

On the other hand, if the window v satisfies the looser form of the perfect 
reconstruction condition given by equation (3), the distortions in the subband 
components and the distortion in the synthesized signal are related as 



(6) 
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0 1 1 N ~ l 

I 2 < ihi 2 < -y 



(7) 



Equation (7) indicates that the distortion in the synthesized signal may grow 
considerably out of proportion when A is small. Thus, the distortion limit of 
equation (6) is one advantage that arises from adopting the power 
5 complementarity condition of equation (4). Another advantage that arises from 
adopting the power complementarity condition of equation (4) is that an input 
signal can be perfectly reconstructed using a synthesis filter bank that consists of 

-., of JV° 0 

filters v\e N ) , which are time-reversed versions of the analysis filters 

j(co 1) 

V(e N ) . It is such filters that are depicted in filter bank 300 of FIG. 1 (with 
10 designation J%) corresponding to the time-reversed version of filter V(z)). That 
provides the convenience of not having to deal with the design of a synthesis 
window. Thus, the synthesis filter shown in FIG. 1 depicts filters 31 , 32, and 33 
that are the time-reversed versions of corresponding analysis filters 11, 12, and 
1 3. It may be noted that the power complementarity condition of equation (4) also 
15 holds for the synthesis filter; i.e., 



FIG. 2 shows an analysis filter bank where, following each of the filters 
there is an associated subsampling circuit. That is, circuits 21, 22, and 23 follow 
filters 11, 12, and 13, respectively. Correspondingly, in the synthesis filter bank 
20 there are upsampling circuits 24, 25 and 26 that respectively precede filters 31 , 32, 
and 33. In the case of the FIG. 2 arrangement where there is subsampling by K in 
the analysis channels, and the input signal is reconstructed from the subband 
components using time-reversed versions of the analysis filters preceded by K- 
factor upsampling, the reconstructed signal, in the Fourier domain, is given by 



N-l •/ _2tt 

£ J%e °" N ) =N for all a e . 
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where A k (e ja> ) are aliasing components 
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N-l ( 2k . 2k 2k , . 
x — , /(&> /) ]{(£> / k) 

Based on M. R. Portnoff, "Time-Frequency Representation of Digital Signals and 
Systems Based on Short-Time Fourier Analysis," IEEE Transactions on Acoustics 
Speech and Signal Processing, Vol. 28, No. 1, pp. 55-69, February 1980; and Z. 
5 Cvetkovic, "On Discrete Short-Timer Fourier Analysis," IEEE Transaction on 
Signal Processing, Vol. 48, No. 9, pp. 2628-2640, September 2000, it can be 
shown that the aliasing components reduce to zero, and a perfect reconstruction 
condition is attained, if the window satisfies the constraint: 

^ik + jK]v[k + iN + jK] = ^S[il k = 0X..X-l (10) 
j K 

io where 

S[i] = \ when/ = 0 

(11) 

= 0 otherwise 

As indicated above, the windows that are considered herein are those that 
satisfy the power complementarity condition in equation (4), and provide a nearly 
perfect reconstruction by having high enough attenuation for o) > 2k I K . That is, 
15 there is no significant aliasing contribution due to subsampling; i.e., 

V(e J6> )V(e J6? ~ K ) « 0 for all co . (12) 
This makes aliasing sufficiently low in each subband independently. 

In order to facilitate design of transition filters, it is convenient to deal with 
windows that also have sufficiently high attenuation for frequencies m > 2k I N . 

20 Sufficiently high attenuation for that purpose means that in the power 

complementarity formula of equation (4), for any 2ml N <ax2n(i + \)l N , the only 
significant contribution to equation (4) comes from a filter having a center 
frequency at 2m /N and the filter having a center frequency at 2kQ+1)/N , the 
contribution of other filters is negligible. Stated in other words, a filter spills energy 

25 into the band of no other filters except, possibly, the immediately adjacent filters. 
Stated in still other words, the attenuation of a filter centered at 2m IN at 
frequencies (2k(i~V))/ N >o)>(2K(i + Y))I N is greater than a selected value. 
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The above addresses uniform filters, with subsampling of K. One aspect of 
the arrangement disclosed herein, however, is the use of filters that are not 
necessarily of uniform bandwidth, where a signal can be analyzed with a time 
resolution that is commensurate with the bandwidth of the filters, and have these 
5 filters be such that a nearly perfect reconstruction of the signal is possible. 
Consider, therefore, a first filter section, such as section 100 in FIG. 2, with 

subsampling of K 17 and a window function v } (v(e & and a second filter 
section, such as section 100 in FIG. 2, with upsampling of K 2 , and a window 

j{g> /) 

function v 2 (v(e Nl ) ). Consider further that window v } filters are employed 
10 below the "break-over" frequency co Q and windows v 2 are employed above a> 0 . 
Consequently, there is a gap in the frequency response function, as shown by the 
filters on lines 10 and 30 in FIG. 3 (where, illustratively, N x = 16 and JV 2 = 12) and 
the resulting gap region 40. This gap is filled with a transition analysis filter 
V h2 (e J{a>+0)o) ) having a window v l2 . 
15 The shape of window v 1}2 is designed to provide for the aforementioned 

near perfect reconstruction of a signal (in the absence of processing between the 
analysis filters and the synthesis filters). When the transition analysis filter is 
chosen to be subsampled at rate K 2 (and the transition synthesis filter upsampled 
at rate K 2 ) the above constraint also means that the shape of window v I>2 satisfies 
20 the expression 




where R is a constant. This can be achieved by selecting the v 1>2 window to have 
the shape of the v 2 at positive frequencies, and the shape of the v } at negative 

frequencies; but the latter being scaled by ^N x /N 2 . That is the arrangement 
25 shown on line 30 of FIG. 3. The physical filter arrangement is shown in FIG. 4, 
with filter bank 150 providing the response of line 10 in FIG. 3, filter bank 160 
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providing the response of line 30 in FIG. 3, and filter 170 providing the response of 
line 20 in FIG. 3. 

In a relatively simple embodiment, the subsampling factors (K V K 2 ) and 
window bandwidths (2k I N v 2k I N 2 ) are selected so that N l /K l =N 2 /K 2 =R, and 
5 the "break-over" frequency, co 0 , is chosen to satisfy the condition 



where ^ and n 2 are integers. In FIG. 3, for example, N x = 16, N 2 =12, j\ =4 and 
n 2 = 3 . Filter bank 150, in accordance with the above, includes filters V x {ze' }ri7tUNx ) , 
where i = 0X2,--.,n i -l,...,N l -n [ +l,N l -n [ +2 9 ... 9 N l -l , and filter bank 170 includes 

10 filters V 2 (ze~ j2mlNl ) , where i = n 2 + l,n 2 +1 9 „.,N 2 -n 2 -1 . 

Based on the above, one can conclude that in order to have nearly perfect 
reconstruction in a nonuniform filter bank that comprises uniform filter bank 
sections, the window of each of the uniform filter banks should have sufficiently 
high attenuation for co greater than 2k divided by the respective value of K of the 

15 filter, and also sufficiently high attenuation for co greater than 2k divided by the 
respective value of N of the filter. The attenuation for co>2k/K controls aliasing, 
while the high attenuation for co > 2k I N facilitates design of transition filters for 
attaining nearly perfect reconstruction with the nonuniform filter bank. Roughly 
speaking, the attenuation at co = 2k I N , and twice that attenuation at co = 2k I K , 

20 expressed in decibels, have a comparable effect on the error in the approximation 
of equation (13). 

To design a window that has a high attenuation in the band k/N to k , one 
can simply impose the requirement that the integral of the energy in that band 
should be minimized. As indicated above, some spilling of energy close to k/N is 
25 permissible, but it is considered important that the attenuation that is farther 
removed from k/N should be large. To achieve this result, it is suggested that 
the function to be minimized might be one that accentuates higher frequencies. 
For example, one might choose to minimize the weighted integral 



2kh x I N x = 2kyi 2 I jV 2 = © 0 ; 



(14) 




(15) 
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This energy function is given by the quadratic form 

E N =v T Tv, (16) 
where v is the window (time) function in the form of a column vector and the matrix 
T comprises the elements: 

5 [T\ j = [ /n a> 3 cos((i - j)w)do> . (17) 

The window design requires minimization of the quadratic form in equation (16) 
under the power complementarity condition which, expressed in the time domain, 
takes the form of the following set of quadratic constraints: 

Y J v[n]v[n + iN] = 5\i]. (18) 

n 

io This is a perfectly valid design approach, but it can be numerically very extensive 
and hard to implement for long windows. A simpler and faster approach to window 
design is found in an article by the inventor herein, Z. Cvetkovic, "On Discrete 
Short-Time Fourier Analysis," IEEE Transactions on Signals Processing, Vol. 48, 
No. 9, September 2000, pp. 2628-2640, which is hereby incorporated by 

15 reference, and is briefly described below. This simpler design approach 
represents a window using linear combination of discrete prolate spheriodal 
sequences that are eigen vectors of matrices S L (a) , with matrix elements: 

[S i (a)] fJ =sin((/-7>)/(/-j>, l<hj<h a^xIN. (19) 
Given a column vector v, the quadratic form v T S L (a)v gives energy of v in the 

20 frequency band (0,tt/N). The eigen vectors p 0? p x ,.„,p L _ x of S L (a) are 

orthogonal, and corresponding eigenvalues ^ 0 ,A l ,... 5 A L _ 1 are distinct and positive. 
Sorting the eigenvectors so that X J > , a window to be designed is constructed 
from a linear combination for the first LIN+k+^ eigenvectors, 

L/N+k 

v= S a,P, (20) 

25 where we take A; to be a number between 5 and 10. The constraints of equation 
(18) translate into the following set of constraints in the expansion coefficients, 
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L/N+k 

%c\y,a m =d[kl k = l,2,...,L/N-l (21) 

where 

# } =£aMpJ" + ^ ( 22 ) 

n 

The design then amounts to finding the minimum of the quadratic form a r l> 
5 under the constraints of equation (21), where 

a = [a 0 .„a (L/ ^ +jfc) f, (23) 

T a = r T Tr , and r is the matrix of the first LIN+k+'\ eigenvectors p l , 

% r = {poP 0 -P L /N + k]- (24) 

fij A transition window v I}2 for joining a uniform section of an ^-channel filter 

jTj 10 bank based on a window v 7 at low frequencies with a uniform section of an 
y jV 2 -channel filter bank based on a window v 2 at high frequencies is designed by 
CO approximating, as closely as possible, the frequency response of v/ at negative 
□ frequencies, and the frequency response of v 2 at positive frequencies. This 

J! amounts to a minimizing energy function £ fr given by 

I 15 4=£|^(e 7ffl )-^ 2 (^0| 2 ^+rK(^)-^ 2 (^)f^ (25) 

where (5 = <s]N l /N 2 , and carrying out the design process as discussed above. 

It is noted that the choice of co 0 is not completely unconstrained because, if 

one wishes the relationship co =^flh. to hold, where both m and N x are integers; 

and this is particularly true when the value of N\ is constrained by other design 
20 preferences. This, however, is not an absolute limitation of the design approach 

disclosed herein because as long as a gap is left between filter banks (like gap 40) 

and a transition filter is designed that meets the requirements of equation (13), 

near perfect reconstruction performance is attainable. 

The above disclosed an arrangement where the frequency band of interest 
25 is divided into two bands, with each of the two bands being handled by one filter 

bank, and with a single transition filter between the two filter banks. It should be 
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realized that this design approach could be easily extended to a non-uniform filter 
bank that includes any number of uniform filter banks, with each uniform filter bank 

segments having a lower cutoff frequency, a>' hwer , and an upper cutoff frequency, 
Kpper anc ' ^e number of filters in the bank being dictated by the two cutoff 
frequencies and the desired bandwidth, 2x/N t . 

The above disclosed an approach for creating a filter bank that performs a 
non-uniform decomposition of a signal having a given bandwidth, by means of an 
example where two different filter bank sections are joined using a transition filter. 
A more general non-uniform filter bank can be created by joining non-overlapping 
sections of any number of uniform filter banks, using a plurality of transition filters. 
Many variations can be incorporated by persons skilled in the art based on the 
disclosed approach, without departing from the spirit and scope of this invention. 
For example, assume that three bands are desired, with "break-over" frequencies 
co 0 and o) x . A set of constants can be selected so that 27tn l IN l - 27in 2 ^IN 2 = g) 0 

and 2?rn 2a /N 2 = 2xn 3 /N 3 = ^ ; filters V u V 2l V 3 can be selected, together with filters 
Pi j2 and F 2j3 can be designed, as described above. 
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